Two Regimes in the Frequency of Words and the Origins of Complex Lexicons: Zipf's Law Revisited

نویسندگان

  • Ramon Ferrer-i-Cancho
  • Ricard V. Solé
چکیده

SFI WORKING PAPER: 2000-12-068 SFI Working Papers contain accounts of scientific work of the author(s) and do not necessarily represent the views of the Santa Fe Institute. We accept papers intended for publication in peer-reviewed journals or proceedings volumes, but not papers that have already appeared in print. Except for papers by our external faculty, papers must be based on work done at SFI, inspired by an invited visit to or collaboration at SFI, or funded by an SFI grant. ©NOTICE: This working paper is included by permission of the contributing author(s) as a means to ensure timely distribution of the scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the author(s). It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may be reposted only with the explicit permission of the copyright holder. www.santafe.edu

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MHSubLex: Using Metaheuristic Methods for Subjectivity Classification of Microblogs

In Web 2.0, people are free to share their experiences, views, and opinions. One of the problems that arises in web 2.0 is the sentiment analysis of texts produced by users in outlets such as Twitter. One of main the tasks of sentiment analysis is subjectivity classification. Our aim is to classify the subjectivity of Tweets. To this end, we create subjectivity lexicons in which the words into ...

متن کامل

Beyond the Zipf-Mandelbrot law in quantitative linguistics

In this paper the Zipf-Mandelbrot law is revisited in the context of linguistics. Despite its widespread popularity the Zipf–Mandelbrot law can only describe the statistical behaviour of a rather restricted fraction of the total number of words contained in some given corpus. In particular, we focus our attention on the important deviations that become statistically relevant as larger corpora a...

متن کامل

Comments to "Bell Curves and Monkey Languages", J. Casti, Complexity, 1, 12-15 1995.

Whether there are universal laws or principles in complex systems is a fascinating and important question. Prof. John Casti uses the case of Normal Distribution (\bell curves") to illustrate that such universal principle is perhaps out there waiting to be discovered [1]. He suggests Zipf's law as a candidate for such universal principle. But as the author of one of the three publications to pro...

متن کامل

Comments on \Bell curves and monkey languages" (letter to the editor

Whether there are universal laws or principles in complex systems is a fascinating and important question. Prof. John Casti uses the case of Normal Distribution (\bell curves") to illustrate that such universal principle is perhaps out there waiting to be discovered [1]. He suggests Zipf's law as a candidate for such universal principle. But as the author of one of the three publications to pro...

متن کامل

The origins of Zipf's meaning-frequency law

In his pioneering research, G. K. Zipf observed that more frequent words tend to have more meanings, and showed that the number of meanings of a word grows as the square root of its frequency. He derived this relationship from two assumptions: that words follow Zipf’s law for word frequencies (a power law dependency between frequency and rank) and Zipf’s law of meaning distribution (a power law...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Quantitative Linguistics

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2001